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ON A VARIANT OF THE LARGE SIEVE 

BEN GREEN 



Abstract. We introduce a variant of the large sieve and give an example of its 
use in a sieving problem. Take the interval [TV] = {!,..., N} and, for each odd prime 
p ^ vN, remove or "sieve out" by all n whose reduction n(mod p) lies in some interval 

06 ■ /p C Z/pZ of length {p- 1)/2. Let A be the set that remains: then \A\ <j N^/^+°W , a 

^^ , bound which improves slightly on the bound of \A\ ^ N^^^ which results from applying 

the large sieve in its usual form. This is a very, very weak result in the direction of 
a question of Helfgott and Venkatesh, who suggested that nothing like equality can 
occur in applications of the large sieve unless the unsieved set is essentially the set of 
values of a polynomial (e.g. A is the set of squares). 

Assuming the "exponent pairs conjecture" (which is deep, as it implies a host of 
classical questions including the Lindelof hypothesis. Gauss circle problem and Dirich- 
let divisor problem) we can improve the bound to \A\ <C N°^^' . This raises the worry 

Li^ , that even reasonably simple sieve problems are connected to issues of which we have 

>-^ ' little understanding at the present time. 

a: 

1. Introduction 

(N' 

^ I The large sieve is, in its purest form, the following analytic inequality. 



Theorem 1.1 (Large sieve). Suppose that N ^ 1 is an integer and that 6 G (0,1) 



cn 

o 

10 ' Suppose that the points 9i, ... ,9k G M/Z are 6 -separated, that is to say 

O' \0^-0^>S (LI) 

00 ! 

O ■ whenever 1 ^ i < j ^ k. Let (a„)ne[Af] be any sequence of complex numbers. Then 

j^ ■ i=l ne[N] ne[N] 

As it stands, this inequality may be thought of as a kind of approximate Bessel's 
inequality; the separation condition (II .ip acts to ensure that the exponentials e{n6i), 
i = 1, . . . ,k, are roughly orthogonal over n G [A^]. The rather clean form of (11.21) was 
proved by Montgomery and Vaughan [12] and to prove it somewhat careful arguments 
are needed. It is much easier to establish a weaker inequality in which the right-hand 
side is replaced by (say) 8(A^ + 5~^) ^ |anP; at the level of the discussions in this paper, 
this is just as good. 

The large sieve gets its name from the fact that (11.21) may be used to give bounds for 
certain sieving problems, of which the following is an example. 
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Problem 1.2 (Sieve problem). Let N ^ 1. Suppose that for each off prime p ^ viV one 
is given a set Sp C Z/pZ with \Sp\ = (p — l)/2. Let A C {1, . . . , N} be the set obtained 
by "sieving out" the residue classes Sp for each prime, that is to say by removing from 
[N] each n for which n(mod p) lies in Sp for some p. What upper bounds can one place 
on A? 

Remark. In actual fact one may consider far more general settings. The sieving limit 
X = yN may be reduced, one might sieve only by a subset of the primes p ^ X rather 
than by all primes, and the size of Sp might vary less regularly with p. 

The large sieve gives, by an argument of Montgomery [10] , the following result con- 
cerning Problem II .21 

Theorem 1.3 (Large sieve bound). In Problem ] l.B we have the bound \A\ ^ CyN for 
some absolute constant C . D 

Theorem 11.31 is essentially sharp. Indeed if one takes Sp to consist of the quadratic 
non-residues for each prime p then the unsieved set A contains all of the squares less 
than or equal to N, and so \A\ ^ (1 — o{l))yN. It is, however, very hard to think of 
an essentially different example giving a comparable lower bound. As a result Helfgott 
and Venkatesh [7] and independently Croot and Elsholtz (cf. [2, Problem 7.4]) were 
motivated to make the beautiful guess that any set A of size close to A/iV which survives 
the sieving process in Problem 11.21 is essentially the set of values of some quadratic 
polynomial. 

Conjecture 1.4 (Inverse conjecture for the large sieve). Suppose that the set A is 
the result of the sieving process in Problem \1.S\ and that \A\ ^ jyo.499^ Then all but 
(9(^jYo(i)^ pQljilg qJ j\ g^g contained in the set of values of a quadratic polynomial f{n) = 
an^ + bn + c. 

Remark. In fact it is not unreasonable to think, as Croot, Elsholtz, Helfgott and 
Venkatesh did, that much more should be true. One might consider more general 
sieving situations than the one in Problem 11.21 and even under an assumption as weak 
as \A\ ^ A^^ the set A should have some fairly rigid "algebraic" structure. 

Conjecture 11.41 seems to be of interest in its own right. Furthermore suitable variants 
of it ought to have applications, for example to Ostmann's Inverse Goldbach Problem 
as considered in a series of papers by Elsholtz [3] . 

We are not able to establish Conjecture 11.41 or anything close to it. The aim of 
this paper is to develop ideas which go somewhat beyond the large sieve in using the 
structure of the residue classes Sp rather than merely their size. Rather than develop 
these ideas in the most general context we use them to address the most extreme case 
in which each set Sp of residues is an interval in TLjp'L. The following is our main result. 

Theorem 1.5 (Interval sieve). Let N ^ 1, and suppose that for each odd prime p ^ viV 
one is given an interval Ip C Z/pZ of length {p — l)/2. Let A C {1, . . . , A^} be the 
set obtained by sieving out all n for which n(modp) lies inside Ip for some p. Then 

|A|<A^l/3+o{l)_ 

Remarks. We have concocted this problem so that the theorem is not obviously 
trivial given known results. For the more specific situation in which, for example. 
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Ip = [p/4, 3p/4] is the "middle-half" it follows from results of Jutila |9] that |y4| = 0(1), 
and it is quite possible that this bound could be made rather effective using the results 
of Granville and Ramare ||ij. It seems quite reasonable to suppose that 1^41 = 0(1) 
in the somewhat more general setting of Theorem ll.5[ uniformly in the choice of the 
intervals Ip. 

Our own arguments are fairly routine, but we do import a very interesting result of 
Roberts and Sargos [13] concerning the spacing properties of the set of unit fractions 
{1/x : X ^ X < 2X}. If the argument were written down in a self-contained manner, 
this result would certainly be the beef. As we shall sketch later on, the exponent pairs 
conjecture may be used to obtain the stronger bound |A| <^ N°^^\ 



2. A VARIANT OF THE LARGE SIEVE 

The goal of this section is to prove the following proposition, which is our variant of 
the large sieve. In this proposition a = {an)ne[N] is a sequence of complex numbers and 

we write 

ne[N] 

for r ^ 1. 

Proposition 2.1. Let a = (an)ne[Af] ^^ a sequence of complex numbers. Let X,N, 
1 ^ X ^ N, be parameters. Then 

J2 I E ^nein/x)\' « (Ar + x3)V4xV2+o(i)||^||^^^||^||^_ 

X^x<2X ne[N] 

The same is true with e{n/x) replaced by e{2n/x). 

Remark. When we apply this proposition to the interval sieve problem in the next 
section we will need the exponentials e{2n/x) as well and that is why we have mentioned 
them here. One could use e{kn/x) for any fixed A;, though the implied constants would 
depend on k. 

Proof. The proof of the corresponding inequality when e{2n/x) replaces e{n/x) is 
identical and we say nothing more about it. Let ip : M. ^ M^q be a Beurling-Selberg 
function with the following properties: 

(i) V^(t) ^ 1 for \t\ ^ A^; 

(ii) Tp{Cj is supported on |,^| ^ 1/-^^, where ip{C) := f'^^ip{x)e~^^^ dx; 
(iii) U\U<^N + X\ 



The use of these majorants in analytic number theory is well-known, and the book 
of Montgomery [IT] or the article of Vaaler ^IH] may be consulted for more information 
concerning their construction. 

Now the left-hand side in our proposition may be expanded as 

ne[N] ^^ ' m,x 
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By Holder's inequality this is at most 

I 1 4/3 
ne[-/V] ^^ ' n m,x 

By property (i) of the majorant ip, the first factor is bounded by ||a||4/3. The expression 
inside the second bracket may be expanded as 

"^ ^ amiam2«m3«m4e(-"^i/xi)e(-m2/x2)e(m3/x3)e(m4/x4)x 

X Vv^(n)e(n(— + )), 

^-^ Xi X2 Xs X4 

n 

which is bounded by 

a:i,a;2,X3,a::4 

Here, the hat denotes the Fourier transform on Z, so we are writing 

n 

By the Poisson summation formula we have 

n 

and so by properties (ii) and (iii) oiip we see that (12.11) is at most C||a||f(A^ + X^) times 
the number of quadruples Xi, X2, X3, X4 G [X, 2X) with 

,1 1 1 1 , 1 

— + ^IFs- 

Xi X2 X3 X4 A'^ 

It follows from Theorem 2 of Roberts and Sargos [13j that there are <^ x'^^"^^^ such 
quadruples. The proposition follows quickly. D 

Remark. The reader familar with basic duality theory in Banach spaces may recognise 
some aspects of the proof of Proposition 12. 1[ It is very closely modelled on the proof 
that ||T||4/3^2 = ||T*||2^4, where T : B{X) -^ B{Y) and T* : B(Y) -^ B{X) are 
mutually adjoint operators on spaces B{X),B{Y) of bounded functions. One might 
also regard our variant of the large sieve as a kind of restriction theorem., where one 
looks at the Fourier transform of the sequence {an) restricted to the set of frequencies 
{l/x : X G [X,2X)}. Proposition 12.11 then reflects a kind of "discrete curvature" of 
this set of frequencies, and is closely analogous to such estimates as the Tomas-Stein 
restriction theorem (cf. [H]). 

3. Interval sieve problem 

It is a reasonably straightforward matter to apply Proposition 12.11 to get the stated 
bound for the interval sieve problem. 

Lemma 3.1. Suppose that p is a prime, that Ip C Z/pZ is an interval of length 
{p — l)/2 and that A C [N] is a set such no a E A has a(modp) G Ip. Then either 
ZlnGWi ^A{n)e{n/p) or XlneWl ^A{n)e{2n/p) has magnitude at least \A\/?,. 
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Proof. We note that 1 — 2 cos 6' + cos 26' ^ when \6\ ^ 7r/2; rewriting the left-hand 
side as 2 cos6'(cos6' — 1), this becomes clear. It follows that there is /5 G [0, 1] (depending 
on Ip) such that if n G A then 



11 Tl 

2cos27r(- + /3) + cos47r(- + /5) ^ 0, 
p p 



and hence 



1 - e{p)e{n/p) - e{-(3)e{-n/p) + -e{2(3)e{2n/p) + -e{-2f3)e{-2n/p) ^ 0. 
Summing over n & A and using the triangle inequality, one obtains 

\A\ ^ 2| ^ lA{n)e{n/p)\ + | J^ U(n)e(2n/p)|, 

nG[Af] n£[N] 

from which the result follows immediately. D 

Remark. What we have shown here is that if A (mod p) does not meet Ip then A has 
an extremely large discrete Fourier coefficient in Ti/pT,. In the usual application of the 
large sieve, one shows that the L^-mass of the discrete Fourier transform of A in Z/pZ 
has significant mass away from the zero mode; this is much weaker information and in 
our case is wasteful as it does not utihse the specific additive structure of the excluded 
residues Ip. 

We may now prove Theorem II. 5[ Suppose that A is the set of those elements of 
[A^] which remain after sieving by all residues in Jp, for all p ^ yN . Then, setting 
a„ := lyi(n), the previous lemma implies that 

max(| Y, U(n)e(n/p)|,| J^ U(n)e(2n/p)|) ^ |yl|/3. 

n&[N\ n&[N] 

Note that if a = {an)n&[N] then ||a||i = \A\ and ||a||4/3 = jAI^/'*. Substituting into 
Proposition 12.11 we obtain for any X the bound 

X\A\^ < (iV + X3)V4xV2+o(l)|^|7/4_ 

Taking X = N^^^ leads to the stated bound. D 

Remark. Our argument has something in common with the argument used to obtain 
lower bounds in the Kakeya problem from restriction estimates for the sphere, which 
had its origin in the work of Fefferman [1] . Indeed our sieve bound, phrased differently, 
provides a lower bound oi N — 0{N^^^^^) for a union of sets (Ip + pZ) fl {1, . . . , N} 
which, at a stretch, might be thought of as "lines" in different directions in the spirit of 
the Kakeya problem. These ideas have been considered in a number theoretic context 
before in the work of Bourgain [Ij. 

4. Further remarks 

We conjecture that the following is true. 

Conjecture 4.1 (Spacing of unit fractions). Suppose that r ^ 1 is an integer. Then 
the number of xi, . . . , X2r G [X, 2X) such that 

1 1,1 , , 

^ -T^ (4-1) 



1 


1 


— + ■ 


■■ + — 


Xi 


Xr 



Xr+l X2r X''^^ 



E 
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is<t:r,eX'-+^ for ane>0. 

If this did hold for a particular value of r then a straightforward modification of our 
arguments (using exponents p = 2r/(2r — 1) and g = 2r in Holder's inequality in the 
proof of Proposition 12. ip would lead to a bound \A\ <^ jV7+t+°'-(^) in Theorem 11.51 By 
bounding the number of solutions to (14. ip using a 2r-power moment of exponential sums 
as in p^ one may confirm that the conjecture would follow if we had a bound 

e(e/x)| <Xi/2+°'-W (4.2) 

X^x<2X 

for X^ ^ 1^1 ^ X^'^^. Such a bound is a consequence of the so-called exponent pairs 
hypothesis, stated on p. 214 of [H]. An excellent source of information on exponent pairs 
is the book [5], though the exponent pairs hypothesis itself is conspicuously absent 
from that book. Nonetheless it seems to be fairly widely believed, and in any case 
the bound (14. 2 p accords with the commonly-held belief that exponential sums should 
exhibit square-root cancellation unless there is a "good" reason for them not to. 

Improving the bounds for sup^ |'S'(^)| seems closely related to the Lindelof hypothesis, 
which is equivalent to proving that 

I Yl e(eiogx)| <Xi/2+o.(i) 

X<ix<2X 

for 1^1 <^ X^, for all r. We do not intend this remark to be taken too seriously: it 
stems from the observation that the derivatives of the phase log x are the same as those 
of 1/x, and the derivative structure of a phase is often important in the estimation of 
exponential sums. Nevertheless, it would be very surprising if anything close to (14. 2p 
were proved tomorrow. Perhaps better evidence for this is that an affirmative solution 
to the Dirichlet divisor problem and the Gauss circle problem would follow from (14.20 
for r = 3. See [S], Chapter 4] for more information. 

We note that (14. 2 p is not known for r = 2; Roberts and Sargos bound the 4th moment 
of their exponential sums directly, without obtaining a bound for the supremum of those 
sums. 
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